Improving TTS quality using pitch contour information of source speaker in S2ST framework

نویسندگان

  • Pablo Daniel Agüero
  • Jordi Adell
  • Antonio Bonafonte
چکیده

Intonation is one of the most important components of human spoken communication. Therefore, a correct prosody in a text-to-speech system contributes to a better quality in terms of intelligibility, naturalness and pleasantness. In order to achieve it, we need to obtain paralinguistic information from other information sources to complement the information in the text because natural language understanding algorithms have limitations that do not allow to interpret the meaning of a text nor have an opinion. In this paper we propose the use of the intonation of the speaker of the source language to extract semantic information in the framework of speech-to-speech translation. It allows to improve the quality of the intonation of the target language. The proposed approach tries to find intonation patterns that correlate between languages. In this way, given a pitch movement in the source language we may infer the pitch movement of the target language. It is possible to use this additional information source as input feature for the intonation model of the textto-speech module of the speech-to-speech translation system. Experimental results support the proposed approach with an increase in the MOS of evaluators (3.1→3.6 and 3.3→3.7 in a five point scale in the task of translation between Catalan and Spanish).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems

Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.

متن کامل

Using Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems

Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.

متن کامل

Non-linear Pitch Modification in Voice Conversion Using Artificial Neural Networks

Majority of the current voice conversion methods do not focus on the modelling local variations of pitch contour, but only on linear modification of the pitch values, based on means and standard deviations. However, a significant amount of speaker related information is also present in pitch contour. In this paper we propose a non-linear pitch modification method for mapping the pitch contours ...

متن کامل

Transforming Pitch in a Voice Conversion Framework

A subtask of voice conversion is to accurately map the pitch contour of a source speaker to a target speaker. So far, the most widely employed method for carrying out this mapping is based on adjusting the pitch range of the source speaker to match the target while keeping the shape of the contour unchanged. In this project, we investigate four alternative algorithms for pitch contour mapping a...

متن کامل

Japanese pitch conversion for voice morphing based on differential modeling

In this paper, we convert the pitch contours predicted by a TTS system that models a source speaker to resemble the pitch contours of a target speaker. When the speaking styles of the speakers are very different, complex conversions such as adding or deleting pitch peaks may be required. Our method does the conversions by modeling the direct pitch features and differential pitch features at the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005